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ABSTRACT 


The general linear model of statistical inference is 
formulated in terms of the Moore-Penrose generalized inverse. 
The matrix algebra of the generalized inverse which is es- 
sential to the model is presented. A methodology for estima- 


tion and hypothesis testing is derived which permits identical 


manipulation of both the ful op the less-than-full rank 


cases of the model. 
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i tVERODUCTLON 

The linear statistical model is a mathematical formula- 
tion which is useful in the interpretation of data and ob- 
servations from experiments. While the assumptions necessary 
for the use of a linear statistical model restrict its appli- 
cation, it is frequently convenient and appropriate to use 
such a model because of its simplicity and its suitability 
to many processes and experiments under investigation. 

This paper will be concerned with one of the most fre- 
quently utilized linear statistical models. It is assumed 
Eniaten observations, Yay are made of a process or experimen- 
tal quantity. The process or experiment has p elements, Aa 
each of which has a fixed value for each of n replications. 
Associated with the vector of observations is a vector of 
errors, denoted by e. The errors are assumed to be uncor- 
related between replications and have a multivariate distri- 
bution with mean vector 0O and variance-covariance matrix o“t. 
In matrix notation, the model is expressed as 

Y= XB +e , 
where Y is ann x 1 vector of observations with mean vector 
X8 and variance-covariance matrix saeinp 4 is-an n x DO Matuix 
Of known constants, and 8 1s a»p x 1 vector of unknown paxzam- 
eters. 5? is the unknown variance of the individual observa- 
eLons. 

By the use of the general linear statistical model, one 
Can estimate the assumed functional relationship between Y 
and X. If the matrix X is of full rank, we accomplish this 


by estimation of 5° and 8. In the less-than-full rank case, 


we concern ourselves with estimation of a? and F’'8, sets of 
linearly independent estimable functions. Several tests of 
hypotheses are frequently useful in the evaluation 

of the functional relationships in the model. Such tests pro- 
vide conclusions concerning the values of the unknown param- 
eters which would be useful in predicting values of Y or in 
explaining the variability of Y. 

This paper discusses the mathematical manipulation of 
a linear model by which conclusions regarding the process 
under investigation might be reached. The general linear 
statistical model has traditionally been treated as two sep- 
arate cases, with two different methodologies. These sep- 
arate methodologies are necessitated by two general forms 
which the matrix X may assume. These forms are the full rank 
form, where the n x p matrix X is of rank p, and the less- 
than-full rank form where X is of rank r < p. The applica- 
tion of the general linear model to experimental design fre- 
quently gives us the less-than-full rank case, whereas in the 
regression analysis application, the full rank case is most 
often encountered. 

As an example of the apparent necessity for different 
methodologies, consider estimation of the vector of unknown 
parameters. The method of least squares yields the set of 
equations, 

X'XB = X'Y , 
where ® is the least Squares estimator of the vector 8. If 
Xyes of fiadd wank, a unique solution for g exists, because 


X'X has an inverse. If, however, the rank of X is r < p, a 


unigue solution does not exist, and reparametrization is com- 
monly used to estimate the invariant features of the solution 
vectors, B. Reparametrization involves the linear trans forma- 
tion of the vector 8 into the vector a and the consequent 
change of the model from 


v XB +e 


ice 

Ye=as4a4aqtrte , 
where Z is a matrix of full rank. The mathematical manipula- 
tion of this model can then be carried out in much the same 
fashion as the manipulation of the full rank case. The lack 
of a unified methodology led the author to consider an ap- 
proach by which both the full rank and the less-than-full rank 
cases could be manipulated by the same mathematical technique. 

In the chapters to follow, manipulations of the general 
linear statistical model will be formulated in terms of the 
Moore-Penrose generalized inverse. The definition of the 
generalized inverse will be taken as the point of departure 
for Chapter II. A survey of the properties of the generalized 
inverse which are preliminary to the statistical formulation 
will be presented. 

In Chapter III the estimation of parameters and functions 
of parameters, and the distribution of the estimates will be 
discussed. The formulation and testing of hypotheses will 
be examined in Chapter IV. For a part of tne third chapter 
and the entire fourth chapter it will be necessary to specify 
a multivariate distribution for the vector of errors. We 


shall assume that e has the multivariate normal distribution; 


that is, eUMVN(0,0°I). This is a commonly assumed distribu- 


tion, as it is justifiable for observational errors ina 
wide range of processes and experiments. 

Appendix A is composed of a Simple numerical example of 
the unified methodology derived in the preceding chapters. 

An experimental design application of the general linear sta- 
tistical model has been chosen to illustrate the use of the 
method in the less-than-full rank case. 

Throughout the development, it will be noted that strong 
reliance has been placed upon the notation, terminology, and 
methods of proof of Graybill (1). This is due, in part, to 
Graybill's consistency with the generalized inverse formula- 
tion. Equally important, however, is the simplicity with 


which Graybill has presented the basic statistical theory. 


Il. THE GENERALIZED INVERSE 

The mathematical concept known as the generalized in- 
verse of a matrix was first introduced by E. H. Moore in 1920. 
Moore (2) presented the first published systematic investiga- 
tion of the properties of the generalized inverse in 1935. 
The concept was not widely recognized until 1955 when R. A. 
Penrose (3) independently rediscovered it. 

The Moore-Penrose generalized inverse of an arbitrary 


matrix A has been defined by Penrose (3) as the solution, 


G = A’ of the following four equations. 
AGA = A (21) 

GAG = G (25.2 ) 

(AG)' = AG (cl) 

(GA)' = GA (22) 


It is convenient to describe the generalized inverse of 
an arbitrary matrix by two equations which are equivalent to 
the defining equations. These equations can be formed by 


muostituting eqn. (223) into eqn 2s2z), and eqn (2.4) sinte 


eqn. (2.1). Respectively, these relationships are 
GG'A' =G CZs) 

and 
AA'G' =A (2.6) 


There are several properties of the generalized inverse 
which will be useful in developing the statistical aspects. 
of this paper. These properties are stated below with source 
references, when appropriate, where a detailed development 


of the property may be found. 


Property (1). The generalized inverse specified by 


the four defining equations is unique. [Penrose (3) ] 
++ 


PrODe TEV aliZ) eek =e [Penrose (3) ] 
PxOpe rE ae). pies = ae [Penrose (3) ] 


Property (4). ae = A'at Nee [Penrose (3) ] 


Rope sey eel5 a ene AA’, Ta Ay: and I-AA* are each 
idempotent. [Penrose (3) ] 


Bree nb 1(6) .seRank (AT) Sete nike Adee mank (Ata) = 


rank (AA) = trace (ATA). [Penrose (3) ] 
BrOpeiWiey a /). (AB)* = B’AT df peecGl Gmina ea litey, ATA 


and BB' commute, and A'A and BB’ commute. [Greville (4) ] 
Property (8). A necessary and sufficient condition 
that AXB = C has a solution for X is AA’CB'B = C. 

; ; ++ -- “ 
The general solution is X = A CB + 2Z- A AZBB , where 
Zis arbitrary. [Penrose (3) ] 


PrOpemew7). Lf A is an nm x p matrix of rank ope 


N01) | NUN ae 
Proof: (A'A) has an inverse, for it is a p xX p matrix 
of rank p. It may easily be verified that (ata) + satisfies 


the defining equations for the generalized inverse of (A'A). 
As the generalized inverse is unique, (A'A)~ = Gos) 
Property alO)ieen 1S atl nm x p Matrix Of eon, 
AA = I. 
Proof: We know from matrix theory that if A is an 
n X p matrix of rank p, then A has a left inverse. It can 
be shown by substitution into the defining equations that 
the left inverse if A’. The right inverse result for a 


p Xn matrix of rank p is, similarly, AAT =e 


10 


ErOperty (ll).  _L£ Seis Seo Fr Fl ber onk and 


iS partitioned such Ube (A, A.) then 
| _+ + + +] 
Ar = : fetenie (5) 
+ + 
In particular, it is noted that when A, ‘A, = Q, then 
+ [i+ 
A = Ay 
+ 
A» ; 
+ - ot aa 7 
since AJA, _ (AJA, A, )A, = Q. 


thal 


et. eo LP MAT TON 

Making the assumptions for the general linear statis- 

tical model, 
Y = X€6 + e, 

where Ete m= Owand HE (ee!) .= er, provides us with a basis for 
estimation of the vector of unknown parameters and functions 
of the parameters. Estimation of these quantities is equiv- 
alent to estimation of the relationships which exist between 
the observations, Y, and the matrix of known constants, X. 

As previously indicated, using the method of least 
Squares to minimize (Y-X8)'(Y-X8) yields the normal equations, 

X'XB = X'Y. 
Regardless of the rank of X, properties (4) and (8) give us 
a general solution for the least squares estimator of 86, 
B= (x'X)*x'y + [I-(X'X)*(x'X)]Z = X*Y + (I-X"X)Z, 
where Z is an arbitrary p x 1 vector. It is apparent that 
if X is of full rank, then by properties (9) and (10), 
6 = xty = (x'x) ¢x'y, 

which confirms the result derived in Chapter I. However, we 
will not set the full rank case apart, for the general solu- 
tion holds for rank (X) = r < p. 

Least squares estimation does not directly provide an 
estimate of cae However, by basing our estimate of 5? on B, 


ee 
an unbiased estimate of o 1S 


-2 _ (¥-xB)' (y-x8) _ Y' (I-XX") Y 
7 n- er — n- rx 


Of immediate interest is the mean vector and variance- 


covariance matrix of the estimator 8. Recalling that Y has 


2 





mean vector X8& and variance-covariance matrix gcse 
E(B) = E[X’y+(I-x"x)Z] = X X8 + [I-xX"°x]2 
and 
var (8) = El (x*y-x*xs) (x*ty-x*xg)') = E[xtee'x''] = 07% (x'x)?, 
LE is noted tnae Q is not, in general, an unbiased estimator. 
However, if the vector 8 happens to equal our choice of the 
arbitrary vector Z, the estimator B is unbiased. 

If we now specify the distribution of the vector of 
errors such that e UMVN(0,0°I) , it can be shown that the 
method of maximum likelihood estimation leads to precisely 
the same results as the least squares estimation of 8. 

By assuming that E(e) = 0 and E(ee') = o“v, where V is 
an arbitrary positive definite matrix, the estimation can be 
somewhat generalized. The normal equations stemming from 
least squares estimation in this more general case are, 

(x'vtx) 8 = x'v74y, 


with solutions, 


al fi 


B= (xiv txytxty y + [r-(x'v tx) *¢x'v74x) Jz. 
Furthermore, 
E(p)ee (xtvotxy Tt oxty7txye 4 [r-¢x' v7 Ex) t (x! v7? x) ]z 
var (B) = E[(B-E(8)) (6-E(8)) '] 
= E[(X'V 7x) *x'v tee! vi tx (xiv tx) 7] 
= (xiv txytyty7tyy (xt v7txy te? = o*(x'y ¢x)*, 


In each of the distributional cases discussed above, the 
Sseimator Of SB 2omMeteunaqie Tireaeme rank (X) = © <>. ene 
less-than-full rank case no direct inference can be made con- 


cerning all the values of the elements of 8. However, there 


is 


are invariant quantities in any solution which are called 
estimable functions. 

Consider a linear combination of the elements of 8, de- 
noted by £'8. £'8 is invariant with respect to the arbitrary 
Jef, AAC On eye If, 

ape) = te 
If X is of full rank, then this relationship holds for all 
f'. In the less-than-full rank case, all linear combina- 
tions of the form where f' = b'X have this property since 


Ft (Tea el — silt (aves a al ea 


Furthermore, since the nullity of bee is r, any basis for 
the null space of [I-x" x] is a set of r linearly indenendent 
solutions to £' [I-x"x] = 0. Since the rank of X is r and 
Soar se = Q, all such vectors f' can be written f' = b'X. 
We may define a linearly estimable function in terms of this 
invariance property; that is, a linear function of the param- 
eters, £'8, 1s estimable if, and only if, £' (I-x'X) = 0. 
Graybill (1) defines a linearly estimable function as a 
function of the unknown parameters for which there exists a 
Veccor be such = that 

E(b'y) = b'X® = £'8B. 
Clearly, the two definitions are equivalent. Graybill fur- 
thermore proves that it is equivalent to say that a linearly 
estimable function is a linear combination of the parameters 


for which a solution for r exists to the equation 


Ti,is this form of the estimable function which will be wmese 


useful to us. 
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Among other results which Graybill has developed with 
applicability in the generalized inverse formulation are 


the three listed below: 


(l). The best linear unbiased estimator of van 
estimable f£Uncrien = 6. .)1s 

we . 

fap = Gt s 
(2) “The funecevens, £8, £58, ae £8 are called 


linearly independent estimable functions if each 

of the functions is linearly estimable, and fi, 

fo, er fn are linearly independent vectors. 

(3). A set of linearly independent estimable 

functions contains, at most, r functions, where 

r= rank 

A linearly estimable matrix function, F'8, iS a set of 
imelinearly pindependenwaestamable fumetions, 1 < m < see 


matrix function may be represented as 


F'B = | | (8) = (R'X'X)8 


ee 
m 
where the £.8 are linearly independent estimable functions. 
A result which will be frequently utilized is that the 
estimator of an estimable matrix function is invariant with 
respect to the arbitrary matrix Z, which follows directly from 


the fact that each component of F'® has this propeniw. 


dls. 


Considering the distributional case where eVMVN(0,0°I), 


the estimator of an estimable matrix function is always 

unique and, therefore, of use in making inferences regarding 

the values of the estimable functions. The mean vector and 

variance-covariance matrix of our estimator are, respectively, 
BE(F'@) = E(F'X’Y) = R'X'(XX'X)8 = F'8 

and 


Var (F)aieso Rul Ee ae) Gn v= 8) ee Rip ce taku), = 0° mee 
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lV. ~ THs Tener ye ore ec 
We shall now discuss a test of the hypothesis F'8§ = C, 
where F'8 is anm x 1 vector of linearly independent estim- 
able functions. F'8 = C is called a linearly estimable 
hypothesis if F'8 is a set of linearly independent estimable 
functions and C is a vector of known constants. To illustrate 
this concept, suppose we desire to test the equality of the 


First three components of 8; that is, 


iteche functions 


£8 = (1 =1 0 Omseee0)e8 
and 
£8 = (1,1), —240: +20) 6 
are linearly estimable, then 
F'B = (f£,,£5)'B = = 0 


im, and only Tisec. — 8, = B,. We require that the linear 


ik 
estimable functions of the hypothesis be independent. To 
have any of these not independent would be redundant. For 
example, in the illustration above, 
£38 = (0 1 -1 0*°:0)8 = 0 

would test the equality of 8. and B 3, which is already being 
tested by £8 and £58. 

In the full rank case, a consequence of the form of the 
linearly estimable function and the estimable hypothesis is 


that the elements of 8, individually, and as a complete set, 


are estimable. The hypothesis that 8 = C is then estimable. 


he 


Furthermore, any subset of the elements of 8 equal toa 

specified set of constants may be tested by choosing the ap- 

propriate rows of the p x p identity matrix as the matrix F'. 
In developing the likelihood ratio test criterion, we 


will be concerned only with the normal theory case where 


eMVN (0,07). The appropriate likelihood function is then 
cenigme-) = +5 n/2 OXP =e Be 
(Zc) 20 


To develop a test of the hypothesis Hy: F'8 = C versus the 


alternative, H,:F'8 # C, we will utilize the likelihood 


i 


Bacio, 


Ee 
a, 


5 = L() 
1 (2) 





~ > 


L(Q2) is the maximum value of the likelihood function with 

the parameters contained in the (B,0°) p + 1 dimensional 
Space, %, which is unrestricted. L(w) is Similarly the maxi- 
mum value of the likelihood function with the parameter 


Space restricted by H The w space is p - m + l dimen- 


0° 
Sional because values for m independent relationships among 
the elements of 8 are specified by the hypothesis. The maxi- 
mizing values of the parameters in the w space shall be de- 
noted by 8 and 52, while the corresponding values in the 2 
space shall be denoted by 8 and ae 

In the restricted parameter space we desire to maximize 
£(e;B,0°) with respect to o* and 8 subject to the restraint, 
F'p = C. 


Let 
iene so) = log, [£(e78,0°) ]. 
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Lt.teo lows that 


0. =.) (YSx%e) Gay Nn _ 9 
an <4 ee 2 ee 
00 26 2a 


The resulting maximizing value of om for any given 8 is 


52 sul Xa XG ieneeD 
n 


Examining the likelihood function it is noted that the 
maximum value of the likelihood function with respect to B 
with ° = g* = (Y-X8)' (Y-X8)/n occurs when (Y-X8)' (Y-X8) is 
a minimum. However, we must constrain 8 by the relationshin, 
F'8 = C. The problem of finding the value of 8 which maxi- 
mizes the likelihood function within the hypothesis constraint 
can then be formulated as the quadratic program, 

mam (Y—-X6)'" (Yee), 
Subject to: FS =n. 


The derivatives of the Lagrangian of the objective function 


ene ¢ 
oY t ' ' ' ' 
ye 7 7 2Y°X + 2B'X'X = gear F" = 0 (Ame) 
i =e 7 


which are the necessary and sufficient conditions for a mini- 
mization where 2A' is the appropriate vector of Lagrange 
multipliers. We first premultiply eqn. (4.1) by ee 
yielding 

-AY + F'B- AA'X = 0. 
SubsititutingeC,, séseomu con. (42) fern F's8, the solMtienweinor Xx 
is 

A= (AA’)"C - A’'y. 


2) 


Returning to eqn. (4.1) with this value of i, and solving 


EOle 8, yields 


~~ 


PCa on ie CAN vy) xX! xX’y 4 (rt -wtx)o 


= ioSdintay + iv + exTio; (amis) 
where Q is an arbitrary p x l vector. 
Since 
—2%' VADMUx (IX A O=X A, AVX y+ (i= X)O)]—2%1A' [(AKL) 1C-a\‘y]) =a 
-X'Y+X'ATC-X'ATAY+X'Y-X'ACHX'A'A’'Y = 0, 
clearly, eqn. (4.1) is satisfied by 8 and ’. Also 
F'B = R'X'X[X A’C-X "A AY+X Y+ (I-X°X)Q] 
= AA*C-AA AY+AY. 


Since A is an m x n matrix of rank n, 


and eqn. (4.2) is also satisfied. 
Returning to the likelihood function and substituting 
while retaining the general term, B, we find that 
L(w) = max £(e;B,0°) = atexpi-n/2) 
2 (27) L(Y=xXe) (Yona 
Hace C 
Z 


Finding the maximizing values of o° and 8 in the unre- 


stricted parameter space yields the results, 


B = X°¥ + (I-X"x)2z (4.4) 
*2_ (¥-x8)' (y-x8) 
n v 


where Z is an arbitrary p x 1 vector, and G7 is a biased 
estimator of of, Combining these results in the likelihood 


function yields the maximum value, 
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. n/2 a 
L(2) = max £(e;B,0°) 22 a ZZ EWE, 
5269 (27) [(Y-X8) ' (Y-XB) ] 
Be. 
The Likelihood ratio Mee enen 
je ee en De 
1) (Y-XB) ' (Y-XB) 


owe 


where 8 and @ are as presented in equations (4.3) and (4.4) 
respectively. 

To develop our test criterion we will first manipulate 
the quadratic forms of the likelihood ratio to form chi- 
Square random variables. The independence of the quadratic 
forms will then be demonstrated and the test criterion estab- 
lished as a non-central F statistic. 

For any B, 

(Y-xXB) '(Y-X8) = (¥-x8)'(¥-xB) + (6-8) '(X'X) (8-8). 

As a consequence, the likelihood ratio can be written as 


1 my 2 
1+ (8-8) 'X'X (6-8) >) 
(Y-XB) ' (¥-xB) 
Let uS now examine the ratio in the denominator. Using 
the solutions for g and 8, it follows that 
(B-8)x'X (B-B) 


cxxty-xxtatcexxtatay-xxty) | (xxty—-xxtatcexxtatay-xxty) 


HI 


(xx A AY-xXx atc) ' (xxtatay—-xxta’c) 


(Ay-C) ' (Att xxta’) (ay-c). 


<2 
By the definition of a non-central chi-square (y °) 
random variable, if Z\MVN(u,V), then Z'BZvy~“(a,h), where 


q is the rank of BV and A = w'Bu/2, if, and only if, BV is 


al 


idempotent. Then if ((at*xxtaty so] [var (AY-C) ] 1s idempotent, 
(B-B) *x'X(B-B) /o7 has a non-central chi-square distribution. 


Z Z 


Since YYMVN(X8,o I), (AY-C) VMVN (AX8-C,o°AA') and 


+, ti + 
BY = oa Po 
fe) 


We must determine whether, or not, AT'xx*ataa! 1s idempotent. 
It is clear that 
A*'xx* (ATAA') = aA’'xx'(a') = atixxt (xR), 
where R'X'X = AX. Therefore, 
A’'XX"ATAA' = AT'XR = AT'A', 
Since A' is a p X m matrix of rank m, it has a left inverse, 
ae and 
BV = A’'XXA'AA' = I. 

Hence, BV is idempotent and of rank m. We can therefore 
conclude that (ay-c) '(at'xxtaty (ay-c) /o? has the non-central 
chi-square distribution with m degrees of freedom and non- 
centrality parameter, \ = (F'®-C)'(A‘'xx'a‘) (F'B-C) /207. 

The quadratic form, (Y-X8)' (Y-x8) is equal to y' (I-XX") y. 
Since (I-Xx") is idempotent of rank n-r, (yY-X8)' (y-X8) has 
the central chi-square distribution with n-r degrees of free- 


dom. Consequently, our likelihood ratio is 


1 n/2 
ae Tay (4.6) 
G 
4 tantiyytat 2 
where yvy~~(m,(F'@-C)'(A 'XX A )(F'B-C)/2o0°) and the variable 
euy (n-r) . 
In general, two quadratic forms T'DT and T'ET are inde- 


pendent if, and only if, DVE = 0, where TYMVN(u,V). 


ZZ 


The identities, 
Q_RQ2\iyt ee Q + Q 2 
y = (B-B)'X'x(B-B)/o = (¥-X8)XX (Y-X6)/o , 
and 
Ai ee ne + se 2 
C= (Y-X8)'(¥-X8)/o =. (Yossi ee 
are convenient forms for demonstrating the independence of 
Yand cz. Since 
(y-x8) = (I-xx'+xx‘a‘a) Y-xx’a‘c , 


and the variance of Y is Eni. then 


Wact(Y-X8) = a” (DEXMU4LX® AA) CL-XX 4X AA)’ = o- (To eee 


The quantity corresponding to DVE is then 
Tx | 


2 
0 


ex +A A) = (i, 





2 


fx'x 
i- 











as can be easily verified. 

The ratio of the non-central chi-square variable to 
the central chi-square variable in the denominator of our 
likelihood ratio with each variable divided by its degrees 
of freedom has the non-central F distribution with m and n-r 


degrees of freedom and non-centrality parameter, 


h = (P'B-C)'(A*'XX7A’) (P'B-C) . 
A necessary and sufficient condition that A = 0 is that the 
hypothesis is true; that is, F'8-C = 0. In this event, the 
ratio of the quadratic forms divided by their degrees of 
freedom has a central F distribution, 


yin-r) _ (Ay-c)' (A'’xx*a") (ay-c) 
ae (¥-x8) * (y-x8) ii 


Since rejection of the hypothesis is consistent with small 
values of the likelihood ratio and the likelihood ratio is 


monotonic decreasing in [y(n-r)j/{t(m))], the critical region 


23 


for this test is 


(n-r) 
amy > Pm n—e 
where Fn ae (a) is the upper significance point correspond- 


ing to Significance level a. 
As substantiation of our likelihood ratio criterion, we 
shall compare it with commonly used criteria for two types 


of tests in the full rank case. The common form of the full 


hypothesis test criterion for 8 = 8*, Graybill (1) for exam- 
ple, is 
n-p (Y-XB*) (X(X!X)""K") (¥-xB*) . 
p | ' 2 p n-p ') 
(Y-XX Y)' (Y-XX Y) 


It 1s apparent that the denominators are equal. Furthermore, 


1 


II 


(Y-XB*) '(X(X'X) 7X") (Y=XB*) = (Y-XB*)"(XX™) | (XX™) (Y-XB*) 


= (XY-B*) "(X'X) (XTY-B*) 
Therefore the test criteria are equivalent in the full 
hypothesis test. 

To test that s of the elements of 8 are equal to known 
constants, while the other p - s elements are unspecified, 
Graybill (1) uses the subhypothesis criterion, 

Ba (YoK Vy MP (K(K'X) “XK, 1X5 'KS) 2X5") (¥-X, 7, *) > F Fe), 
2 (YX 7] #) | (I-K (X1X) OXY X14) Se 

For the purpose of this full rank test, the elements of the 

general linear model have been partitioned such that 


XxX = (X) Xo), st om (Y4 +Y>) 


and the hypothesis is : cla vy" where ar is an s x l vector 


Or Known Constants. 
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In the generalized inverse formulation this hypothesis 
may be tested by setting C = Y1* and choosing F' of the 
form F' = [I : 0], where the identity matrix is of dimen- 
sion s and the null matrix has dimensions s x(p-s). Asa 
Simplification we shall consider only the full rank case 


where Yq is orthogonal to Yoi that is, X1X, =uOem aGance 


F' = AX = [I 0], we must have 
AX, = I 
AX, = 0. 


By property (11) of Chapter II, it is then clear that 


A= x). Our hypothesis is then F'8 = Yq") and the test 


Griterion is 
(Gee FN. XX Xue 
n=p i, a aal 2 1a 1. 


z > F p 


(y-xB) ' (y-xB) 
Again it is apparent that the denominators are equal. 
The numerators are also equal since 


Pe arn) Oboe iim cereto cnc) arc lower 


=) 


1 


5g a 
(Y-Xj¥,*) (XX “XX. ) (Y-X, 7,*) 


(VaR yg ey Oe ee 


= wily th ' te a a * 
BE SR MEPL OMRON | on yay) 


12) oun sneer + 
(XpYoyt) | (Xp XX" X,) (Xp Yeyy*) 
The equivalence of our test criterion to the commonly 


used criterion in this special case of the subhypothesis 


test is then clear. 
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APPENDIX A 


Estimation and Hypothesis (Pestinge- anus Preation 


Consider a completely randomized experimental design 


model, where 


il 
a 
~ 
NO 
~ 
W 


Y= Te? 7.3 1 
ij cage Bs; 
ee 
We shall suppose that the required normality assumptions 
are justifiable for our statistical model, 
Y = X68 + e 
where 
ev MVN(0,0“I). 


The necessary elements of our statistical model are: 


Observations Design Structure 
1 20 530) 
S u it O50 
2 s Ty Ole Sila 
— = X = 
2 T5 Oo 120 
| 
g I™3] i OmOm 
10 1 00 1 


The rank of X is r = 3. By solution of the equations de- 
fining the generalized inverse, 
bed ee ae 


. | 2a - 1. =) 
x" = 1/8 


=> ieee See | — | 


eae 3 «8 


or 


6 i Z 2 


: ee iO 140 
and (X'X)’ = 1/64 


ZOO 8, 


[2 -10 -10 
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As this is an example of the less-than-full rank case, 


the estimate is not unique. 
solution, 


2° eepeaesaliZ 


D> 
Hl 
mS 

K 
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WAS 
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The mean vector of 8 is 


Bene Xx XG = [T-x)xXI]zZ 


ytetot2ts 


1727T974T3 


6u+2T 


Paes 


1/8 
2u-2T +6T{-2T 


il 3 


Ai 2h -21 +67 3| 


iu 


and the variance-covariance matrix 


Tee 


Var (8) = ae ee = Baye 


Furthermore, the unbiased estimate 
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The estimator has the general 


2-2 -2 -2 
=2 20202 
Z 
-2 2 2 2 
[1-2 2 2 2| 
2720 2 tee 


~22,+225+22 +22, 


~22,+225+22,+22) 


|-22)+225+22,+22, 


a 


= 4/3 





The estimator of 8 in the less-than-full rank case by 
itself does not tell us anything about the individual com- 
ponents of 8. However, consider the estimate of a function 
of parameters, where 

Ri ciel Omuly ey ae 
; 7 Zee 4 


A = REX = 9ag2ely/2e207/2 2) 0 | 





Weei/2 1/2 “7 ae 


. 


and 
F' = AX = ; Pi) or 
01 1-2 


\ 


The estimator of the estimable matrix function, F'8, is 


rac ee 0 
eB = rxty = ([,9] 


With mean and variance-covariance matrix 


E(F'®) = F'8 = Ken 


Wise (id lee o°F'IF = 2 Oa 
0 6 


It is commonly desired to test the equality of all the Tay 
the treatment effects. In this event, the corresponding 


estimable hypothesis is 


OSL 0 Ge 0 
F'6 = B= | > * =| |. 
Oil 1 -2 eee “27, 0 


Z 
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For this estimable hypothesis, the generalized inverse of 
A' is 
ee ee 2 1 2 0 0 
AT! = 
iow 6 1/651 /6 =-1/3 -1/3 
mie testeStacwsLic, 


tat) (ay-c) 


-- - = 24.5. 
(Y-XB) ' (Y-XB) 


[n=r Y= | aiets (AY-C) ' (AT ' XX 
| mj} co \m 


The critical region for this test is 
See eG = 
| = | ; > Fo 36-95) or ote 


We must therefore reject the hypothesis, H 
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